AITopics | data-driven analysis

Collaborating Authors

data-driven analysis

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SCALM: Towards Semantic Caching for Automated Chat Services with Large Language Models

Li, Jiaxing, Xu, Chi, Wang, Feng, von Riedemann, Isaac M, Zhang, Cong, Liu, Jiangchuan

arXiv.org Artificial IntelligenceMay-24-2024

Large Language Models (LLMs) have become increasingly popular, transforming a wide range of applications across various domains. However, the real-world effectiveness of their query cache systems has not been thoroughly investigated. In this work, we for the first time conducted an analysis on real-world human-to-LLM interaction data, identifying key challenges in existing caching solutions for LLM-based chat services. Our findings reveal that current caching methods fail to leverage semantic connections, leading to inefficient cache performance and extra token costs. To address these issues, we propose SCALM, a new cache architecture that emphasizes semantic analysis and identifies significant cache entries and patterns. We also detail the implementations of the corresponding cache storage and eviction strategies. Our evaluations show that SCALM increases cache hit ratios and reduces operational costs for LLMChat services. Compared with other state-of-the-art solutions in GPTCache, SCALM shows, on average, a relative increase of 63% in cache hit ratio and a relative improvement of 77% in tokens savings.

llmchat service, query, semantic pattern, (13 more...)

arXiv.org Artificial Intelligence

2406.00025

Country:

North America > United States > Mississippi (0.04)
Asia > China > Hong Kong (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Promising Solution (0.48)
Research Report > New Finding (0.48)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

The Big Data Market: A Data-Driven Analysis of Companies Using Hadoop, Spark, Data Science, and Machine Learning

#artificialintelligenceApr-9-2016, 08:14:19 GMT

Aman's background is in the intersection of Business Applications and Artificial Intelligence, using both to drive the next generation of business applications Aman also founded and worked in various startups in search, social, trading systems, and enterprise software. His last startup was TopCorner, a political platform for micro-lobbying. Aman was the architect for IBM SuperSell Enterprise and Oracle CRM. He was previously the Director of Special Projects for the CEO's office at Oracle. Aman earned a MS in Computer Science with research focused on natural language processing (NLP) from Stanford.

data mining, data-driven analysis, natural language, (7 more...)

#artificialintelligence

Industry:

Information Technology (0.95)
Government (0.89)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.85)

Add feedback